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(57) Abstract: A device for and method of calibrating a microphone, comprising a loudspeaker (3) for converting a loudspeaker 
input signal (5) into sound; a microphone (4) for converting received sound into a microphone output signal (16), and calibration 
means for calibrating an output power of the microphone relative to a desired power leveL The calibration means comprise impulse 
response estimating means (7) for estimating an acoustic impulse response of the microphone by correlating the microphone output 
signal (6) and the loudspeaker input signal (5) when the microphone (4) receives the sound firom the loudspeaker (3). whereby the 
output power of the microphone (4) is estimated. 
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Device and method for calibration of a microphone 



The present invention relates to microphone output signal levels and more 
specifically to the calibration thereof to a desired level. When output levels of different 
microphones are compared, it is assumed that the acoustical excitations thereof are identical. 
Manufacturers supply microphones having output levels varying aroimd a specified mean 
5 value. For the often used back-electret microphones, such tolerances are ±4 dB. 

Consequently, the output levels of such microphones may show a difference of up to 8 dB. 
Microphones with tolerances of ±2 dB are sometimes available. These, however, are more 
expensive. 

A usual approach for gain calibration of a microphone is carried out in an 
10 anechoic chamber, i.e. a chamber without reflections or reverberation. A loudspeaker is 
placed in firont of the microphone (at an angle of 0**) inside the anechoic chamber. The 
loudspeaker plays a noise sequence at a known power level and the power of the microphone 
response is measured. Subsequently, an adjustable gain is set. 

15 

Further an audio processing arrangement is disclosed in patent application WO 

99/27522. According to this prior art reference, filtered sum and weighted sum beamforming 

are developed for maximizing power at the output. Filtered sum beamforming (FSB) makes 

the direct contributions maximally coherent upon adding thereof. 
20 With multimicrophone algorithms such as beamforming, it is very important to 

sort the microphones during production to obtain sets with level dilEferences within the 

required tolerances. 

Moreover, with some multi-microphones systems, the consumer may buy 

additional microphones later in time, which will also have to be calibrated before installation. 
25 The present invention provides a device for calibration of a microphone, 

comprising: 

- a loudspeaker for converting a loudspeaker input signal into sound; 

- a microphone for converting received sound into a microphone output signal. 
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- calibration means for calibrating the output power of the microphone relative 
to a desired power level, said calibration means comprising impulse response estimating 
means for estimating an impulse impulse response of the loudspeaker and/or the environment 
at the microphone of the microphone by correlating the microphone output signal and the 
5 loudspeaker input signal when the microphone receives sound from the loudspeaker, hereby 
the output power of the microphone is estimated. 

As indicated above, calibration of microphones is often of crucial importance 
for good performance of multimicrophone systems. The present invention is concerned with 
the adaptive calibration (in software) of microphones imder reverberant room conditions. An 
10 advantage of the present invention is that the microphones need not be selected or calibrated 
when manufacturing an audio system, saving production time and sometimes additional 
hardware. The present invention can be applied in all speech commxmication systems vrfiere 
one or more microphones and a loudspeaker are available. One can tliink of handsfree 
telecommunication systems, but also of handsfree speech recognition systems for voice 
IS control of e.g. a television set. 

Non-uniformly ageing of microphones which can also lead to output level 
*, differences will also be neutralized by this invention. 

j In a preferred embodiment of the invention, direct.part removal means are 

provided for removing the direct part of the so called acoustic impulse response (a.i.r.) in 
20 order to use especially the diffuse part of the a.i.r.. An advantage hereof is, that calibration can 
be executed during use in a normal environment, e.g. a room of a microphone and without 
the need for adding hardware being added. Calibration during the actual use also allows for 
either absolute calibration or relative calibration. 

Another preferred embodiment comprises high and low pass filter means for 
25 filtering low and high frequencies, allowing for better calibration by using frequency ranges 
where signal quality is best suitable for processing. 

Another preferred embodiment comprises squaring and simmaation means for 
creating a representation of the current power level of the diffiise soundfield response of the 
microphone in order to create a value that can be related to a desired level. 
30 The invention further preferably comprises relating means for relating the 

power level of the (diffuse) microphone response with a desired power level. 

Although it may be possible to obtain an absolute value for the desired power 
level, this desired power level is preferably available from a reference microphone. 
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Further advantages, features, and details of the present invention will become 
clear when reading the following description with reference to the aimexed drawings, in 
which: 

5 

Fig. 1 is a perspective and partly diagrammatic view of a preferred 
embodiment of present invention in an audio conferencing system; 

Fig. 2 is a diagram of a prior art setting for calibration of a microphone in an 
anechoic chamber; 

10 Fig. 3 are graphs of a typical a.i.r. at 0° of a microphone and a corresponding 

energy decay curve (e.d.c.) as a function of time; 

Fig. 4 are graphs of a typical a.i.r. at 1 80° on the same microphone as in Fig. 3 
and the corresponding decay curve (e.d.c.) as a function of time; 

Fig. 5 is a diagram of adaptive microphone calibration as included in the 
1 5 embodiment of Fig. 1 ; 

Fig. 6 is a diagram of adaptive microphone calibration relative to a reference 
j microphone which can also be used in the embodiment of Fig. 1 ; 

' Fig. 7 is a diagram of relative calibration relative to reference microphone 

which can be also be used in the embodiment of Fig. 1; and 
20 Fig. 8 is a diagram of a band pass filter and subsequent squaring and 

summation operation for use in the diagrams of Figs. 5-7. 



Fig. 1 shows an audio conferencing system. It comprises a main console 1 and 
25 one or two satellite microphones 2 for a larger pick-up range of speech, which each contain a 
microphone, and is connected to a floor unit 23, which is connected to a power source 24 and 
a telephone network 25 of some kind, e.g. a PSTN (RJl 1) or an ISDN (RJ45). The main 
console comprises, a loudspeaker for producing (voice) sounds, and three nucrophones for 
picking up (voice) sound. Furthermore, telephone means are comprised for making contact to 
30 other telephones through a telephone network. Hie microphones preferably inter-operate as 
seamlessly as possible. For this purpose, the invoition provides means in order to allow for 
the abandonment of pre-installation calibration of the microphones in the satelli- 
temicrophones or even microphones in the main console. 
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Another example of use of a device according to present invention (not 
shown) relates to voice based commanding of a television set e.g. for switching channels or 
controlling the volume, by using microphone input This can also be embodied in a form with 
, one or several microphones. In order for a system to use the microphone ou^ut signal, 
5 calibration can be necessary. 

For clarification some acoustical concepts are explained that are relevant for 
understanding the detailed description of the drawings. In Fig. 2, a loudspeaker 3 and a 
microphone 4 aiming towards that loudspeaker (thus at 0**) inside a room are shown. 

An acoustic impulse response (a.i.r.) can be estimated from the loudspeaker 
10 excitation signal and the microphone response by correlation techniques. An a.i.r. is the 
response on an impulsive acoustic excitation. An example of such an estimated a.i.r. is 
depicted in Fig. 3. During the first few milUseconds the response is zero due to the delay 
from the limited sound speed in air. Next, a large peak can be observed, which is due to the 
response to the direct acoustic propagation of the sound from the speaker towards the 
15 microphone, and is called the direct sound field contribution. This peak has a normalized 

value of 1 .0. The tail relates to this value as depicted in this graph. The tail of the a.i.r. is due 
- to reflections against room boundaries, and is called the difluse sound field contribution. 
These reflections have a random character and iuCTease statistically in density and decrease 
exponentially in amplitude in time. The combined efiEects of the reflections are called rever- 
20 beration. 

An important Amotion of the ai.r. is the energy decay. In discrete time, with n 
-i the sample index, the energy decay at index n amounts to the energy left in the tail of the 
J a.i.r.. In Fig. 3 the so*<:alled energy decay curve (e.d.c.) corresponding to a.i.r. is also loga- 
rithmically plotted. On the Y-axis the quantity is measured in dB. The e.d.c, shows an abrupt 
25 change due to the direct component. The difference in energy decay just before and just after 
this jiunp is called the clarity index. A larger clarity index implies a larger direct/difliise ratio 
and thus less reverberation. The envelope of the diffuse tail of the a.i.r, has an exponential 
decay which leads to the constant slope of the logarithm of the tail of the e.d.c. The rever- 
beration time T60 is the time interval in which the reverberation level drops down by 60 dB. 
30 It is found for this case that T60 = 0.36 s. 

Microphones can have imidirectional beam patterns. Unidirectional 
microphones only pick up acoustic signals from a certain range of angles around 0°; they 
more or less block acoustic signals arriving at 180°. This means that the direct field 
contribution of an a.i.r. measured at 180** will be almost zero. 
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In Fig. 4 the a.i.r. and the e.d.c. of the same (unidirectional) microphone as of 
Fig. 3, but now at 180^ are plotted.There also is a value normalized to one, yet only the tail is 
shown as this represents the diffuse response. By comparing fig, 3 and Fig. 4 it appears that 
at 180° the direct contribution has vanished while the diffuse contribution has the same 
5 exponential envelope in both Figs.. 

In the following, it is assumed that the energy in the diffuse tail of the a.i.r. 
does not depend on the microphone or loudspeaker orientation and location ia the room. In 
practice some variation are foimd depending on orientation and location, but these variations 
are small when the acoustic absorption pattern in the room is more or less homogenous and 
10 the reverberation in time is not to small (T60 > 100 ms). It is worth mentioning that a typical 
room has a reverberation larger than 300ms. A general rule is that the bigger a room is the 
longer the reverberation time is. 

The present invention uses as input not only the microphone response but also 
the excitation signal of the loudspeaker (Fig. 2), First, the a.i.r. is estimated from the loud- 
1 5 speaker to the microphone using a well-known correlation method in the estimating means. 

When acoustic cancellation is performed, this adaptive filter is already available. The diffuse 
■ I part of the a.i.r. is selected in the direct part removal means. At low frequencies the 
--' loudspeaker output and/or the microphone sensitivity is low, which leads to unreliable a.i.r. 

coefficients. Therefore a high-pass filter is applied to the diffuse part of the a.i.r. at the 
20 highest frequencies, near the Nyquist frequency, the signal levels will also be low due to 

anti-aliasing filters. Thus, to deal with unreliable a.i.r. coefficients at high frequencies a low 
pass filter is applied. 

In Fig. S, these high and low pass filters are combined to a band pass filter. 
The filtered coefficients are squared and summed in the squaring and summation means, 
25 which leads to actOal power level 14 representing the current power of the diffiise 

microphone response. This power level is related to a desired power level 20 and the gain 
fitctor is determined as the square root of the quotient of these power levels. 

In the preferred embodiment this calibration method can be applied each time 
the adaptive filter comes up with a new estimation of the a.i.r. For increased robustness of an 
30 acoustic echo canceller a progranmiable filter is sometimes used (as described in US 

4,903,247). The adaptive filter runs in the background and the programmable filter, which 
takes its coefGcients conditionally from the adaptive filter, is used for the actual echo 
removal. In this case it is best to take the coefficients of the programmable filter and apply 
the caUbration procedure after each coefficient transfer. 
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The loudspeaker 3 (Fig. 5) gets a loudspeaker input signal 5. Microphone 4 
receives the sound that is being produced by the speaker 3 and transforms this into 
microphone output signal 6. Digital values of signals 5 and 6 are being fed to estimator 7. 
The estimator 7 produces estimated values 9 that pass through to direct part removal part 8 
5 embodied in software. From here digital values 10 are fed to digital band pass filters 1 1 . 
Signals 12 from these band pass filters are fed to a squaring and summation program 13. 

The estimated actual power level (P) 14 is fed to a relating program 15 as is an 
(external) desired power level (Q) 20. From here the calibration gain factor 16 is fed to the 
averaging means 17. An adjusted calibration gain factor 18 is being fed back to the 
10 microphone output signal in order to form the caUbrated signal 19. 

Especially when combined with an adaptive filter for acoustic echo can- 
cellation the proposed microphone calibration method can be applied all the time that the 
system is active. In Fig, 5 the calibration factor being the square root of the desired power 
level divided by the actual power level is averaged to ensure that successive calibration gain 
1 5 fectors will change smoothly. Such averaging can be done with a first-order recursion. This 
averaging procedure can also be applied to the actual power 14 and the desired power 20 
before the calculation of the square root of the desired power level divided by the actual 
J power level. 

Below, the process of the embodiment of Fig. 5 is described. This preferred 
20 embodiment of the present invention requires as input not only the microphone response 6 
but also the excitation signal S of the loudspeaker (Fig. 2). First, the a.i.r. is estimated from 
the loudspeaker to the microphone using a correlation method in the estimating means 7. 
Only the diffuse part of the ai.r. is selected in the direct part removal means 8. The band pass 
filter 1 1 is used for filtering out high and low frequencies. The filtered coefGcients are 
25 squared and smnmed in the squaring and summation means 13, v^ch leads to actual power 
level 14 representing the current power of the difiuse microphone response. This power level 
is related to a desired power level 20 and the gain factor is determined as the square root of 
the desired power level divided by the actual power level. 

Fig. 6 shows the same configuration as Fig. 5 except for the averaging means 
30 17 and relating program 15. This configuration is used in case of referential calibration for 
the reference microphone whereby the desired power level 20 is input for the relating means 
15 of the other microphones calibration means using the reference microphone as their j 
reference. 
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Fig. 7 shows how the building blocks of Fig. 5 and 6 can be combined for 
referential calibration for use in e.g. an audio conferencing system as in Fig. 1 . 

Fig. 8 shows graphically how tiie averaging algorithm would work in 
calculating the power P of a diffuse sound field response of a microphone. The scheme 
5 consists of a band pass filter followed by summation of the squared output values. At a 
sampling rate of 8 kHz good filter parameters leading to low-pass and high-pass cutoff 
fiiequencies (-3 dB) of about 200 Hz and 3.6 kHz, respectively, are b=0.800, al=0.128, and 
a2=0.621. 

The present invention is not limited to the above preferred embodiments; the 
10 rights applied for are defined in the aimexed claims. 
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1 . A device for calibration of a microphone, comprising: 

- a loudspeaker (3) for converting a loudspeaker input signal (S) into sound; 

- a microphone (4) for converting received sound into a microphone output 
signal (16), and 

5 - calibration means for calibrating an ou^ut power of the microphone relative 

to a desired power level, said calibration means comprising impulse response estimating 
means (7) for estimating an acoustic impulse response of the microphone by correlating the 
microphone output signal (6) and the loudspeaker input signal (5) when the microphone (4) 
receives the sound from the loudspeaker (3), whereby the output power of the microphone (4) 
10 is estimated. 

' 2. A device according to claim 1 further comprising direct part extraction means 

(8) for extracting a direct part of the acoustic impulse response. 

15 3. A device according to claim 1, further comprising high and lowpass filter 

means (11) for filtering low and high frequencies. 

4. A device according to claim 1, further comprising squaring and summation 
means (13) for creating a representation of a current power level of a diffuse microphone 

20 response. 

5. A device according to claim 1, further comprising relating means (15) for 
relating a power level (14) of the diffuse microphone response with a desired power level 
(20). 

25 

6. A device according to claim 5, in which an output (16) of the relating means 
(15) or the averaging means (17) is fed back to the microphone output signal (6) as 
calibration factor (18). 
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7. A device according to claim 5, whereby the desired power level (20) has a 
predetermined value for absolute calibration of the microphone. 

8. A device according to claim 5, comprising a reference microphone (B) for a 
5 relative calibration of one or more microphones (A) relative to the reference microphone (B) 

whereby the output of the squaring and summation means (13) of the reference microphone 
form the input for the relating means (1 S) for the other microphones. 

9. A device according to claim 3, whereby the high and low pass filter means are 
1 0 combined into a bandpass filter (11). 

10. A device according to cleiim 1, arranged for averaging a calibration factor (16) 
is averaged. 

15 11. A device according to claim 1 0, in which the averaging is performed before 

the calculation of square root of the desired power (20) divided by the actual power (14). 



A method of calibrating a microphones, using a device according to claim 1. 
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